THE DISTRIBUTION OF SHORT CHARACTER SUMS 



YOUNESS LAMZOURI 



Abstract. Let x be a non-real Dirichlet character modulo a prime q. In this paper 
we prove that the distribution of the short character sum S x ,h{x) — J2x<n<x+H x( n )> 
as x runs over the positive integers below q, converges to a two-dimensional Gaussian 
distribution on the complex plane, provided that log H — o(log q) and H — > oo as 
q — > oo. Furthermore, we use a method of Selberg to give an upper bound on the rate 
of convergence. 



1. Introduction 

Understanding the behavior of character sums is one of the central problems in 
analytic number theory. Let \ be a non-principal Dirichlet character modulo a large 
prime q, and define the short character sum 

S x,h(x) := *( n )> 

x<n<x+H 

where H = H{q) < q. When \ = (jj) is the Legendre symbol modulo q, Davenport 
and Erdos [2] studied the distribution of S Xi h(x) as x runs over the positive integers 
below q, and proved that it tends to a normal distribution of mean zero and variance 
H, if logH = o(logg) and H — > oo as q — > oo (this restriction on H makes it possible 
to compute the moments of S x ^h{%))- More precisely they showed that in this range 



1.1) - 



< x < q- 1 : ~ x ^> < \ 



S*, H (x) .1 1 ' A 



x2 / 2 dx, 



— oo 



2tt 

as q, H — > oo. However, their method does not give information on the rate of this 
convergence. 

An analogous result to (1.1) in the case of sums of the Mobius function in short in- 
tervals have been obtained by Ng [5] conditionally on a variant of the Hardy-Littlewood 
prime ^-tuples conjecture adapted to the case of the Mobius function. His result states 
that, under this assumption, the distribution of the sum J2 x<n<x+H M n ) as x varies 
over the integers below iV converges to a Gaussian of mean and variance 6H/tt 2 , if 
log if = o(logiV) and H — > oo as iV — > oo. 
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Recently in |3], Mak and Zaharescu generalized Davenport and Erdos approach by 
investigating the distribution of more general short exponential sums. Their results 
imply that for any non-real character \ modulo q, both ReS X: n(x) and Itd.S X) h(x) have 
a limiting Gaussian distribution. However, their method does not seem to handle the 
joint distribution of ReS Xj n(x) and ImS^ ^x). 

In this paper we extend both the results of Davenport-Erdos and Mak-Zaharescu, 
namely by proving that S Xt n(x) has a two-dimensional Gaussian distribution on the 
complex plane. Our approach relies on a method of Selberg originally applied to prove 
log- normality for the values of ((1/2 + it) (see Tsang's thesis [6] and Selberg [7]), where 
((s) is the Riemann zeta function. 

Theorem 1. Let x be a non-real character modulo a large prime q, and let 71 be a 
closed rectangle in the complex plane with edges parallel to the axes. If log H = o(log q) 
and H — > oo as q — > oo then 

1 ax r < (i - i : . ; g r \ = 

2tt 



O<x<q -l:^0l en 
~ ~ q y/H/2 



1 ft / x 2 + y 2s 

exp I I dxdy 



7v 



+ o(,^) + 1 )(^ + v ^f 

where stands for the two-dimensional Lebesgue measure. 

We also note that our method furnishes the same bound (as in Theorem 1) for the 
rate of convergence of the distribution of S Xj h(x) /vH to the standard Gaussian in 
(1.1), in the case where x is the Legendre symbol modulo q. Moreover, it is possible 
to obtain an analogous result to Theorem 1 for more general short exponential sums if 
one combines our approach with the work of Mak and Zaharescu in [I]. 

Remark that if x is non-real we need to normalize S Xi h(x) by \/ H/2 while if x is 
real then the normalization factor is \/H (see (1.1)). Indeed, one can show that in the 
latter case 

~ ]T S XjH (x) = o(l) and - J2 \S x ,h(*)\ 2 = (1 + o{l))H, 

V x=0 Q x=0 

so that H is the variance of S Xj h{%) in this case. On the other hand, if x is non-real 
then it follows from Proposition 2.2 below that 

9-1 /i \ 1 9-1 



- \*eS x Mx)\ 2 =(l + o(l)) - l^(^)! 2 - f 

^ x=0 ^ ' ^ x=0 



and that the same estimate holds for haS X) H(x). 

In analogy with multiplicative functions, Chatterjee and Soundararajan [1] studied 
the distribution of random multiplicative functions X(n) in short intervals. In this 
probabilistic model, the values of the multiplicative function at the primes X(p) are 
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independent random variables which take the values —1 and 1 with probability 1/2, 
and the values at all natural numbers are built out of the values at primes by the 
multiplicative property. Using Stein's method for normal approximation they proved 
that J2n<h<n+h X(n) is approximately Gaussian if iV 1//5 logiV <C H = o(N/ log N). 
This random model was originally introduced in order to understand the distribution 
of partial sums of the Mobius function, which explains the restriction on the lower 
bound of H (this is connected to the problem of the existence of square-free integers 
in short intervals). In our case, we are concerned with Dirichlet characters, and so the 
corresponding probabilistic model involves random completely multiplicative functions. 
Therefore in this case, Chatterjee and Soundararajan result should hold when H = 
o(N/ log N) and H — » oo as iV — > oo. Motivated by this argument we formulate the 
following conjecture: 

Conjecture 1. Let x be a non-principal character modulo a prime q. Suppose that 
H = H(q) = o(q/ log q) and H — » oo as q — > oo. Then, when x is real, the distribution 
of Sx ^jj X ^ for < x < q — 1 tends to a Gaussian with mean and variance 1. More- 
over, when x is non-real then S ^q^ has a limiting two-dimensional standard Gaussian 
distribution on the complex plane. 

2. Moments of short character sums 

Let x be a non-real character modulo q and 1 < H < q be a positive integer. In order 
to investigate the joint distribution of ReS X: n(x) and lmS Xt H(x) we shall compute the 
moments 

1 9-1 

M(r, s):=-J2 (ReS XtH (x)) r (lmS X!H (x)) s , 

for non-negative integers r, s. Mak and Zaharescu jl] had previously computed the 
moments of ReS Xt n(x) (and also those of 1uiS X! h(x)) and proved that they are close to 
the moments of a Gaussian. In our case applying their method leads to weaker error 
terms. Instead, our strategy is to introduce a more adequate probabilistic model for 
the character sum S x> h(x). Let Xi, . . . , X H be independent random variables uniformly 
distributed on the unit circle, and define 

Zh '■= Xi + • • • + Xh- 

Using the Weil bound for character sums (see Weil [8]) we will prove that the moments 
M(r, s) are very close to the corresponding moments of the random variable Zh, that is 
to say the expectation W,((ReZ H ) r (ImZ H ) s ^ . (Throughout the paper E(-) stands for the 
expectation of the random variable in brackets). First we require the following lemma. 

Lemma 2.1. Let r, s be non-negative integers. If r + s is odd then 

E((ReZ H ) r (lmZ H y) = 0. 
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On the other hand, if r + s = 2m is even then 



E((ReZ H Y(lmZ H y) = B m (H) £ * (') (f) (-1)-* 

0<j<r,0<k<s ^ ' V J / V / 
j+k=m 

where B m (H) is the number of positive integers 1 < y 1; . . . , y m , . . . , z m < H such 
that {yi, . . . ,y m } = {zi, . . . ,z m }. 

Proof. We have 

Z// + Z H \ ( Z H — Zn 



E((ReZ H ) r (lmZ H ) s ) = E 



2i 



(2.1) =E ^2-Qzi r zT- i ^(2i)-*Qz*(-Z ff ) 

Moreover, notice that 



a<n<H / \Km<H 



l<n\,...,n^<H l<mi,...,mi<H 



Hence, we find 



E (z k H Z H l ) 



—^_\B k {H) \ik = l 

I otherwise. 

The result follows upon inserting this estimate in (2.1). □ 
Using this lemma we establish the following result: 

Proposition 2.2. For any non-negative integers r, s such that H r+S < g 1 / 2 we have 
M(r, s) = E((ReZ H ) r (Im^) s ) + O (H r+S {r + s)q- 1/2 ) . 

Proof. First, similarly to (2.1) we derive 
(2.2) 



M(r *\--X^\ S ^ H ^ ± S xM X ) \ f S X , H (X) ~ S x , H (x) 

= lzZzZ 2 ~ r ( •) s x M%Mxf~ j J>r (l) s x , H (x) k (-s^)) 
y x =o j=o fc=o ^ ' 

-§£^C)fi) ( - ir4/ f + *- r+ - tf+ *»' 
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where 



Furthermore, we have 



J(M) = - Y s xM x ) k s x M x ) 1 - 



x=0 



'<*.') = E 

x=0 \x<n<a;4 

= ;E E 



) ( Y ^ m )\ 

J \x<m<x+H / 



^2 x{ni---n k )x{mx- ■■mi). 

x=0 x<n\,...,n,k<x+H x<mi,...,mi<x+H 

Writing yj = rij — x for 1 < j < k and z t = m t — x for 1 < t < I, we deduce that 
(2.3) 

1 9-1 

H k J)= Y Y -Y x {( x+yi ">"'( x+y ^)x{( x+ z i)---( x + z i)\ 

l<yi,...,y k <H l<zi,...,zi<H 9 x=0 

We split the set of positive integers 1 < yx, . . . , y^, z\, . . . , Z\ < H into diagonal terms 
{?/!,..., y k } = {zi,...,Zi} (which will give the main contribution to I(k,l)), and 
off-diagonal terms {y±, . . . , y k } ^ {zi, . . . , z{\. To bound the contribution of the off- 
diagonal terms, we use the Weil bound for character sums (more precisely Corollary 
11.24 of Iwaniec-Kowalski [3]) which yields in this case 

9-1 

^x(i x + yi)---( x + yk)jx({x + zt) ■ ■ ■ (x + z{yj 

x=0 



(2.4) 



On the other hand, if {yi, . . . , yk} = {z\, . . . ,zi} then k = I and 

9-1 

(2.5) J2x{i x + yi)---( x + yk))x({x + z 1 )---{x + zi)^ =q + 0{k), 



x=0 



since x( x + y) = if x = —y mod q. 

Therefore, in the case where k I, all the terms in the sum on the RHS of (2.3) are 
off-diagonal, which in view of (2.4) gives 

\I(k,l)\ < (k + l)H k+l q- 1/2 . 

On the other hand, if k — I then using (2.4) and (2.5) we get 

I(k, I) = B k (H) fl + of^j+O {kH 2k q- 1 ' 2 ) = B k (H) + O (kH 2k q^ 2 ) , 

since Bk(H) < H 2k . Thus, inserting these estimates in (2.2) and appealing to Lemma 
2.1 we get 

M(r, s) = E((ReZ H ) r (lmZ H ) s ) + O (H r+S (r + s)<T 1/2 ) , 
as desired. □ 
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3. An asymptotic formula for the two-dimensional characteristic 

FUNCTION OF S X:H (x) 

To lighten the notation, we shall define the normalized short character sum by 



S x {x) 



Let $ x be the characteristic function of the joint distribution of ReS x and \mS x , which 
is defined by 

$ x (m, v) — — exp (iuReS x {x) + ivlmS x (x)j . 



The purpose of this section is to establish the following theorem, which shows 
that $ x (u,v) is very close to the characteristic function of a two-dimensional stan- 
dard Gaussian distribution. This will be the main ingredient of the proof of Theorem 
1. 

Theorem 3.1. Let q be a large prime and N be a positive integer such that N < 
log q/ (20 log H). Then, for any real numbers u,v such that \u\, \v\ < if 1 / 4 we have 

w ^ P (-^)( 1 + o(^!)) 

In order to prove this result, we shall first use Proposition 2.2 to show that $ x (u, v) 
is approximately equal to the characteristic function of the joint distribution of ReZn 
and lmZ H , where 

z- Zh 



Then, using that Zh — X\ + ■ ■ - + Xh, where the Xj are independent random variables 
uniformly distributed on the unit circle, we shall prove that the characteristic function 
of the joint distribution of ReZ H and lmZ H is close to the characteristic function of 
a two-dimensional standard Gaussian distribution in a wide range, if H is large. More 
precisely, we have 

Lemma 3.2. Let u,v be real numbers such that \u\, \v I < H 1 ^. Then 

E ( e ^ + ^ = exp (i + o (^)) • 

Proof. First, remark that 

E((ReX 1 ) r (ImX 1 ) s ) = / (cos #) r (sin 9) s d6 = I (smt) r (cost) s dt, 

27r J-* 27r J-K 
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by making the change of variable t = n/2 — 9 and using that sin 9 and cos 9 are periodic 
with period 2ir. Therefore since sin 9 is odd, then 

(3.1) E ((ReX!) r (ImXi) s )) = 0, if r is odd or s is odd. 
Furthermore, the independence of the Xj yields 

(3.2) E ( e »ReZ^imzn = E ( ( ^^ ReX + _^i mXl 

Now, using (3.1) along with the fact that e lx = 1 + ix — x 2 /2 + (ix) 3 /Q + 0(x 4 ) for all 
x G R, we deduce 

E I exp | -4^=Ii(>.V ; + -4I=Ii.i.Vi 



= l-^E((ReX 1 ) 2 )-^E((ImX 1 ) 2 )+o( 



u 4 + v 4 + u 2 v 2 

W 2 



u 2 + v 2 „ ( u 4 + v 4 



= 1 - — +0 



2iJ V # 2 

since -u 2 t> 2 < -u 4 + t> 4 . The result follows upon inserting this estimate in (3.2). □ 

Proof of Theorem 3.1. Let N be a positive integer such that H N < g 1 / 20 . Using the 
Taylor expansion 



e lx 



k=0 

we get 



g -i2jv-i2jv-i (iuReSJx)Y (iv1mSJx)Y 

0.3) *,(«,.)= EE E- — ^ — ^+*, 

y x=0 r=0 s=0 

where 

/ / _ \2JV / \ 2JV / \ 2JV \ 



1 9-1 

(3.4) y ^=o 



■uReS' x (x)) ( vImjS' x (x) J ( ReS' x (x)ImS , x (a;) ) 



+ T^TTTT^ + 



(27V)! (2N)\ (2N)\ 2 

u 2N 2 N v 2N 2 N (2uv) 2N 

« m2NY Mmo) + n*m\ hm2N) + ikiw Mmm) - 

Furthermore, combining Lemma 2.1 and Proposition 2.2, and noting that B^(H) < 
k\H k we derive 

(3.5) M(2r, 2s) < E ((ReZ H ) 2r (lmZ H ) 2s ) + q' 1 ' 4 < (r + s)!iF+ s , 

for any non-negative integers r, s with if r+s < q 1 / 10 . Hence, inserting this estimate in 
(3.4) and using that (N\) 2 < (2N)\ we deduce 

(2tiT + (2t;T (2 W )^ 
(3 ' 6) El AH + 72iV)T- 
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Next we compute the main term on the RHS of (3.3). Define 

q -i 2N-i2N-i ( i u ReS x (x)Y (ivhnS x [x) 



q * — ' * — ' * — ' r\s\ 

* x=0 r=0 s=0 
2N-12N-1 , y, y 

V V { ] { ' Mir s) 

r=0 s=0 v ' 1 



Then, appealing to Proposition 2.2 we obtain 

2N-1 2N-1 



r=0 s=0 

Moreover, the main term on the RHS of the last estimate equals 
1 (iuReZ H ) r \ ( r-^ 1 (ivlmZ H ) 



r=0 / \ s=0 



{2N)\ J J \ \ (2N)\ 



(2tx 2 ) jv + (2x; 2 ) Af (2uv) 2N 
N\ + (2N)\ 

which follows from (3.5). Thus, we infer from Lemma 3.2 that 

u 2 + v 2 \ ( „ f u A + v 4 



Jjv(m, = exp — 1 + 



H 



+0( (2„y : (2„r + (^g +g - 1/4(1+ ^ )(1 + 



2 AT' 



A^! (2iV)! 

Finally, combining this estimate with (3.3) and (3.6) completes the proof. □ 

4. The distribution of S x>h (x): Proof of Theorem 1 

In order to prove Theorem 1 we shall appeal to the following Lemma of Selberg 
(Lemma 4.1 of [BJ), which provides a smooth approximation for the signum function. 
Selberg used this lemma in his proof that log £(1/2 + zt) has a limiting two-dimensional 
Gaussian distribution (see [BJ and [7]). 

Lemma 4.1 (Lemma 4.1 of [6J). Let t > 0. Define 

2u 

G(u) = h 2(1 -w)wcot(Tra) for u e [0,1]. 

7T 

Then for all x 6l we have 

sgn(x) = fc H sin(2^x)^ + O (7 
Moreover, G(u) is differentiable and < G(m) < 2/tt for u G [0, 1]. 



Here and throughout we shall denote by l a>( g the indicator function of the interval 
[ol,0\. Observe that 

sgn(:r — a) — sgn(:r — j3) 



WW 



Moreover, define 



Then 



faA u ) :z 



(4-1) 1/^(^)1 = | 



2tt/3m 

e~ u dt 

2-wau 



< iru\f3 — a\. 



Furthermore, it follows from Lemma 4.1 that 
(4.2) 

i ( T \ _ t™ C n ( u \ piniux f ( , d u i n f sin 2 (TTt(x - a)) sin 2 (7rt(x - g)) \ 

Let 7?. = [a, b] x [c, rf] be a rectangle in the complex plan, and denote by 1^ the indicator 
function of 71. Moreover, let z = x + iy be a complex number. Then using the identity 

(4.3) Im(wi)Im(w 2 ) = ^Re(u>iW^ - WiW 2 ), 

we infer from (4.2) that 
(4-4) 

l n {z) =±Re jf jf G (|) G (|) (e 2m ^-^/a, b («)^>) - e w <^>/«>)/c>)) ^ 
+ Q / anVM) + sin 2 (7rt(x-&)) + sin 2 (7rt(y - c)) + sin 2 (7r% - d)) 



(7rt(a:-a)) 2 (7rt(a; - b)) 2 (irt(y - c)) 2 (vrt(y - d)) 2 
In order to prove Theorem 1 we shall require the following lemma: 

Lemma 4.2. Let t be a large positive real number. Then, uniformly for all real numbers 
a < b we have 

Proof. Let X be a standard Gaussian random variable. Since ~E(e ttx ) = e~* 2 / 2 , then 

On the other hand we have 

-L fe- x2/2 dx = P(X e [a,b]) = E(l a>b (X)). 
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Therefore, it follows from (4.2) that 

JO Vt/ U y/ZTTJa 

^ E f sm 2 (nt(X - a)) + sin 2 (7rt(X - b)) 



(nt(X - a)) 2 (7rt(X - b)) 2 
To bound the RHS of the last inequality we use the following identity: 
, A . sin 2 (7rte) 2(1 - cos(27rtx)) 2 /"* , 

Let / be a real number. Then (4.5) yields 

'(w) = >f (1 - ,)rtr) * 

= 4 / (* - cos(27r/t;)e- (2 ^ )2/2 ^ < -. 
t Jo t 

This concludes the proof. □ 

We are now ready to prove Theorem 1. 

Proof of Theorem 1. Let iV < log q/ (20 log H) be a positive integer and 1 < t < 
min(iJ 1 / 4 , N) be a real number to be chosen later. Then, using (4.4) we deduce that 

\ Ex=o M^xO*)) equals 
(4-6) 

^Re jf jf G (|) G (|) ($ x (2™, -2™)/ a , b (^;^>) - $ x (2™, 2™)^^)/,,^)) ^ 

+ O (/ x (t, a) + I x (t, b) + J x (t, c) + J x (t, d)) , 
where 

7 (t 1 ^ sin 2 (vrt(ReS x (:r) - Q) 
X ' 9^ (7rt(Ite5 x (x) - I)) 2 ' 

and 

J (t 1 ^ sin 2 (^(Im^^) - Q) 
Xl ' ^ (7rt(Im5 x (x)-/)) 2 
We begin by estimating the main term in (4.6). First note that (4.1) implies 

< n ^ 2{n) 

uv 

Therefore, using this inequality and appealing to Theorem 3.1 we find that the integral 
in (4.6) equals 

W* jf (t) g (f ) - (-®=^) - /-WW.)) 
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Now, using (4.3) along with Lemma 4.2 we deduce that the main term of (4.7) equals 



(4.8) 



1'/«(t) g (D-(^)'<'™77 



= 2,11 6XP dUdV + ° 



Next, we bound the contribution of the error term in (4.6). Using the identity (4.5) we 
derive 

9-1 rt 



W) = - Q Y1 I ^V^ cos (l™(ReS x {x)-l))dv 

= ^ Re E f Q 2 ^~ V K - 2 ™ 1 exp (27riuReSj(x)) dv 

= Re f 2 ^~ V K - 2 ^ x (27rv, 0)dv. 
Jo t 



Hence, it follows from Theorem 3.1 that 

I, (t, e- (2 "" 2/2 ( 1 + 0(£))dv + (i*tf (± + 

A similar bound for J x (t, I) can be obtained along the same lines. Therefore, combining 
the estimates (4.6)-(4.9) we deduce 

\ E M^)) = /X exp ("^-p") dxdy + 0( ^ 2) ' 

where 

, ,„s \ ( 1 (4vrt) 2JV+2 (47rt) 4JV+2 1/4//( , 4iV+2 \ 



We choose 



^ ' 607ry log/7 / ' 



and AT = [(87rf) 2 ]. Hence, Stirling's formula yields 

(47Tt) 2jV+2 (47Tt) 4iV+2 1_ 

TV! + (2AT)! ^t 2 ' 

which implies 



£ 2 « (^(%) + 1) (#- 1/4 + + q - l '\^tr^ 

Finally, noting that 

g- 1 / 4 (47Tt)( 20 ^ 2 < 5 -l/4( 47r ^)log,/(91ogH) <<; 5 -l/ 8> 
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concludes the proof. □ 
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